Saturated Transformers are Constant-Depth Threshold Circuits
نویسندگان
چکیده
Abstract Transformers have become a standard neural network architecture for many NLP problems, motivating theoretical analysis of their power in terms formal languages. Recent work has shown that transformers with hard attention are quite limited (Hahn, 2020), as they can be simulated by constant-depth AND/OR circuits (Hao et al., 2022). However, is strong assumption, which may complicate the relevance these results practice. In this work, we analyze circuit complexity saturated attention: generalization more closely captures patterns learnable practical transformers. We first show transcend known limitations hard-attention then prove floating-point values threshold circuits, giving class TC0 an upper bound on languages recognize.
منابع مشابه
Uniform constant-depth threshold circuits for division and iterated multiplication
It has been known since the mid-1980's [15, 46, 47] that integer division can be performed by poly-time uniform constant-depth circuits of Majority gates; equivalently, the division problem lies in P-uniform TC 0. Recently this was improved to L-uniform TC 0 [19], but it remained unknown whether division can be performed by DLOGTIME-uniform TC 0 circuits. The DLOGTIME uniformity condition is re...
متن کاملA Bounded Arithmetic Theory for Constant Depth Threshold Circuits
We deene an extension R 0 2 of the bounded arithmetic theory R 0 2 and show that the class of functions b 1-deenable in R 0 2 coincides with the computational complexity class TC 0 of functions computable by polynomial size, constant depth threshold circuits.
متن کاملOn Small Depth Threshold Circuits
In this talk we will consider various classes defined by small depth polynomial size circuits which contain threshold gates and parity gates. We will describe various inclusions between many classes defined in this way and also classes whose definitions rely upon spectral properties of Boolean functions.
متن کاملLinear-Size Constant-Depth Polylog-Treshold Circuits
We present a simple explicit construction giving unbounded fan-in circuits with $o(n)$ gates and depth $O(r)$ for the threshold function of $n$ variables when the threshold is at most $(log n)^r$, for any integer $r>0$. This improves a result of Atjai and Ben-Or, who showed the existence of circuits of size $n^{O(1)}$. This is the highest threshold for which polynomial-size, constant-depth circ...
متن کاملPseudorandomness for Read-Once, Constant-Depth Circuits
For Boolean functions computed by read-once, depth-D circuits with unbounded fan-in over the de Morgan basis, we present an explicit pseudorandom generator with seed length Õ(log n). The previous best seed length known for this model was Õ(log n), obtained by Trevisan and Xue (CCC ‘13 ) for all of AC (not just read-once). Our work makes use of Fourier analytic techniques for pseudorandomness in...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
ژورنال
عنوان ژورنال: Transactions of the Association for Computational Linguistics
سال: 2022
ISSN: ['2307-387X']
DOI: https://doi.org/10.1162/tacl_a_00493